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SURFACE PLASMON RESONANCE BASED ENZYMATIC ASSAY 
5 Background of the I nvention 

High-throughput assays for screening potential inhibitors of 
proteases are known. An example of such an assay is the scintillation 
proximity assay (SPA). SPA technology involves the use of beads coated 
1 0 with scintillant Bound to the beads are acceptor molecules such as 

antibodies, receptors or enzyme substrates which interact with ligands or 
enzymes in a reversible manner. 

For a typical protease assay the substrate peptide is biotinylated at 

1 5 one end and the other end is radiolabeled with low energy emitters such 

as 125 I or 3 H. The labeled substrate is then incubated with the enzyme. 
Avidin coated SPA beads are then added which bind to the biotin. When 
the substrate peptide is cleaved by the protease, the radioactive emitter is 
no longer in proximity to the scintillant bead and no light emission takes 

2 0 place. Inhibitors of the protease will leave the substrate intact and can be 

identified by the resulting light emission which takes place in their 
presence. 

The SPA assay works well. However, labeling of the substrate could 

2 5 result in inactivation of the substrate. In addition, radiolabeled emitters 

may pose both health and environmental concern. Therefore there is a 
need for producing a high-throughput assay which does not require the 
use of radioactive substances. 

30 

Summary of the Invention 

The present invention fills this need by providing for a method for 

3 5 determining if a substrate is cleaved by an enzyme using surface plasmon 

resonance. According to the process of the present invention, the 
substrate and the protease are placed together in solution in a reaction 
vessel under conditions wherein the protease can cleave the substrate after 
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which the reaction is stopped. The solution containing the protease and 
substrate are then brought into contact with a ligand bound to a sensor 
chip, wherein the ligand is able to bind to the substrate, and wherein the 
mass of the intact substrate versus the mass of the cleaved substrate is 
5 detected by surface plasmon resonance technology. 

The present invention is further comprised of a method of 
determining if a test substance is a protease inhibitor. The test substance is 
placed in solution in the reaction vessel with the protease and the 
1 0 substrate. The solution containing the protease, substrate and test 

substance are then brought into contact with a ligand bound to a sensor 
chip, wherein the ligand is able to bind to the substrate, and wherein the 
mass of the substrate versus the mass of the cleaved substrate is detected by 
surface plasmon resonance technology. If the substrate is cleaved as 

1 5 determined by its decrease in mass as detected by surface plasmon 

resonance technology then the test substance is not a protease inhibitor. 
On the other hand if the substrate is a protease inhibitor, then the 
substrate does not have a decrease in mass as is determined by surface 
plasmon resonance. 

20 

Detailed Description of the Invention 

A novel, high throughput enzymatic assay utilizing the surface 
plasmon resonance technology (SPR) has been successfully developed. 

2 5 Using this assay, and a dedicated BIAcore™ instrument, at least 1000 

samples per week can be screened for either their enzymatic activity or 
their inhibitory effects toward the enzymatic activity, in a 96 well plate 
format. This methodology is readily adaptable to any enzyme-substrate 
reaction. We have successfully used this methodology to develop high 

3 0 throughput assays for CMV and HCV proteases. The advantage of this 

assay over the currently available SPA assay is that it does not require a 
radiolabeled peptide substrate. 

3 5 BIAcore™ is a processing unit for Biospecific Interaction Analysis. 

The processing unit integrates an optical detection system with an 
autosampler and a microfluidic system. BIAcore™ uses the optical 
phenomena, surface plasmon resonance to monitor interaction between 



WO 96/35806 



3 



PCTYUS96/06385 



biomolecules. SPR is a resonance phenomenon between incoming 
photons and electrons on the surface of thin metal film. Resonance occurs 
at a sharply defined angle of incident light. At this angle, called the 
resonance angle, energy is transferred to the electrons in the metal film, 
5 resulting in a decreased intensity of the reflected light, SPR response 

depends on a change in refractive index in the close vicinity of the sensor 
chip surface, and is proportional to the mass of analyte bound to the 
surface, BIAcore continuously measure the resonance angle by a relative 
scale of resonance units (RU) and displays it as an SPR signal in a 
1 0 sensorgram, where RU are plotted as a function of time. 

In addition, BIAcore™ uses continuous flow technology. One 
interactant is immobilized irreversibly on the sensor chip, comprising of a 
non-crosslinked carboxymethylated dextran providing a hydrophilic 

1 5 environment for bimolecular interaction. Solution containing the other 

interactant flow continuously over the sensor chip surface. As molecules 
from the solution bind to the immobilized ligand, the resonance angle 
changes resulting in a signal registered by the instrument. 

2 0 In contrast to many current high throughput assays based on 

technologies that require radiolabeling of one of the reactants studied, in 
BIAcore™ no need for radiolabelling exists. This important advantage 
allows for development of assays for studying reactions where labeling of 
one of the component is either not feasible or would interfere with the 

2 5 true interaction of biomolecules targeted for screening. 

Since the commercial availability of the BIAcore™, the surface 
plasmon resonance technology has been extensively and primarily been 
used to study the interaction of biomolecules i.e. protein-protein, protein- 

3 0 DNA, protein-antibody, peptide-antibody, etc. in real time. To our 

knowledge, there are currently no published methodologies for use of the 
SPR technology, specifically in a high throughput fashion for following 
the extent of enzyme-substrate reaction, determination of catalytic activity 
of enzymes, determination of enzyme kinetics and ultimately screening 
3 5 for inhibitors of enzymatic activity. Here, we disclose a novel 

methodology which when used in conjunction with SPR technology 
allows for rapid screening of inhibitors of enzymatic activity. In this 
methodology, the enzymatic reactions are carried out outside of the 
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BIAcore, i.e. in reaction tubes or 96-well tissue culture plates, as it is 
conventionally done for any of the currently available high throughput 
assays. The SPR is only used as a detection means for determination of the 
amount of an intact substrate remaining in a solution with and without 
5 the enzyme after the reaction is quenched. 

In order to measure the amount of the intact substrate prior to the 
addition of enzyme, a means of capturing the substrate onto the sensor 
chip had to be established. In addition, to satisfy the requirement for a 

1 0 high throughput assay on the BIAcore, the substrate needed to be removed 

from the surface subsequent to completion of analysis. This is required 
since the same surface will be used for the subsequent reactions. To 
accomplish these two requirements, a phosphotyrosine is synthetically 
attached to one end of the substrate. The phosphotyrosine was chosen due 
15 to the commercial availability of an anti-phosphotyrosine monoclonal 
antibody. The antibody is covalently attached to the sensor chip by 
standard amine coupling chemistry. The anti-phosphotyrosine antibody, 
bound permanently to the chip is used to capture the phosphotyrosine- 
containing substrate in a reversible manner. The antibody- 

2 0 phosphotyrosine interaction is ultimately used to capture and release the 

peptide substrate when desired by regeneration of the surface with various 
reagents i.e. 2 M MgCl2- 

Introduction of the intact peptide onto the antibody surface results 

2 5 in a larger mass which is detected by the instrument. To follow the extent 

of peptide cleavage, a mixture of peptide substrate and enzyme is 
incubated for the desired time and then quenched. Introduction of this 
mixture containing the cleaved peptide and the intact peptide to a 
regenerated antibody surface results in a lower mass value than that 

3 0 detected for a sample containing only intact peptide. The difference in the 

two values is then used to calculate the exact amount of intact peptide 
remaining after cleavage by the enzyme. 

Although the reduction in mass can be directly followed with many 
3 5 large substrates, due to the small mass of a typical synthetic peptide 

substrate (10-20 amino acids, 1-3 Daltons), the mass difference, and thus 
the signal difference between the intact and cleaved peptide is very small 
within the signal to noise ratio of the instrument. To circumvent this low 
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sensitivity, we synthesized a biotin molecule on the N-terminal of the 
peptide. By addition and thus tagging of peptide with streptavidin prior to 
injection of tagged peptide onto the antibody surface of the chip, the signal 
due to the presence of streptavidin will be higher. Using this approach, a 
5 cleaved peptide lacking the N-terminal half, tagged with streptavidin will 
result in a much lower signal. 

Figures 8 and 9 shows a schematic of a sample receptacle 10 of a 
surface plasmon resonance instrument . In sample container 12 is a 
1 0 sensor chip 16. Attached to sensor chip 16 is antibody 18 which is bound to 
substrate 20. 

The following examples are included to illustrate the present 
invention using hepatitis C protease and its substrates and the 
1 5 cytomegalovirus protease and its substrates. 

Examples 1 



2 0 Surface Pla smon Resonance Assay 

The present example illustrates a method for determining if a 
compound can be useful as an HCV protease inhibitor using the surface 
plasmon resonance assay. Figures 8A, 8B, 9A and 9B. illustrate the 

2 5 technique. 

Procedure for Coupling Anti-phosphotyrosine Mab to the Sensor Chip 

The anti-phosphotyrosine Mab is coupled to the carboxymethylated 

3 0 dextran surface of a sensor chip in the following manner. The flow rate 

used throughout the coupling procedure is 5 ^il/min. The surface is first 
activated with a 35 jil injection of NHS/EDC (N-hydroxysuccinimide/N- 
dimethyllaminopropyl-N'-ethylcarbodiimide-HCl). This is followed by a 
40 ml injection of Mab 4G10 at 50 ^ig/ml in 10 mM sodium acetate buffer, 
3 5 pH = 4.0. Any remaining activated esters are then blocked by the injection 
of 35 pJ of 1 M ethanolamine. These conditions result in the 
immobilization of approximately 7,500 response units (420 |iM) of 
antibody. 
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Binding of Peptide and Regeneration of Mab 4G10 Surface 

The flow rate used throughout the BIAcore analysis run is 5 
5 |il/min. A 4 injection containing streptavidin-tagged peptide (peptide 
concentration at 2jiM, streptavidin binding sites concentration at 9jiM) is 
carried out. The amount of streptavidin-tagged peptide bound to the 
antibody surface (in response units) is measured 30 seconds after the 
injection is complete. 

1 0 

Regeneration of sensor chip surface 

Regeneration of the Mab 4G10 surface is achieved using a 4 |il pulse 
of 2 M MgCl2 after each peptide injection. Surfaces regenerated up to 500 

1 5 times still showed 100% binding of tagged peptide. 

Determination of the Optimal Concentration of Peptide and Streptavidin 

To determine the optimal peptide concentration, a standard curve 

2 0 was generated using various amounts of peptide (0-10 ]xM) in the presence 

of excess streptavidin. A value in the linear range, 2 |iM, was chosen for 
standard assay conditions. 

The amount of streptavidin required to completely tag the peptide 

2 5 was determined using a peptide concentration of 2.5 \iM and titrating the 

amount of streptavidin (|iM of binding sites). All the peptides were 
shown to be completely tagged when streptavidin concentrations greater 
than 3 (iM (approximately equimolar to the peptide concentration) were 
used. A streptavidin concentration of 9 (iM (a 4,5 fold excess) was chosen 

3 0 for standard assay conditions. 



Application of Described Methodology to HCV Protease 

35 

The HCV protease 5A/5B peptide substrate, 
DTEDWACSMSYTWTGK (SEQ ID NO 18), with phophotyrosine at the C- 
terminal and biotin at the N-terminal is synthesized. Anti- 



9635806A1J_> 



WO 96/35806 



7 



PCTAJS96/06385 



phosphotyrosine monoclonal antibody, 4G10 was coupled to the sensor 
chip. 

In the absence of HCV protease, the intact streptavidin-tagged 
5 biotinylated phosphotyrosine peptide results in a large signal (large mass 
unit/large response units) through its interaction with the anti- 
phosphotyrosine monoclonal antibody. 

The protease-catalyzed hydrolysis of the phosphotyrosine- 
1 0 biotinylated peptide was carried out in a 96 well plate. The reaction was 
stopped with an equal volume of the quenching buffer containing 
mercuribenzoate. Streptavidin was added to tag the peptide which binds 
to the biotin. The cleaved peptide which lacks the tagged streptavidin (less 
mass) results in the loss of response units. 

1 5 

Using this assay, numerous compounds can be tested for their 
inhibitory activity since the antibody surface can be regenerated 
repetitively with 2 M MgCl2- 

2 0 The peptide cleavage activity by HCV protease can be monitored in 

a time dependent manner using the BIAcore-based methodology. 
Using the concentrated enzyme and the BIAcore substrate, 
EiOtin-DTEDWAC SMSYTWTGK-pY (SEQ ID NO 17), 50% substrate 
cleavage is achieved within 1 hour using the BIAcore-based HCV assay. 

2 5 Based on the amount of enzyme, His-NS3(183)A4AHT needed to reach a 

50% cleavage within 2 hours, a time scale desired for a development of a 
high throughput assay, we estimate that 1 liter of fermentation of the His- 
NS3(183)A4AHT construct results in enough protease to run at least 100 
reactions on the BIAcore. 

30 

Standard Operating Procedure for BIAcore-based HCV Assay 

Reactions are prepared in a 96-well tissue culture plate using the 
Reaction Buffer (50 mM HEPES, pH 7.4, 20 % glycerol, 150 mM NaCl, ImM 

3 5 EDTA, 0.1% Tween-20,1 mM DTT ) as diluent. The final reaction volume 

is 100 Sample with the peptide alone (Biotin-DTEDWAC 
SMSYTWTGKpY) is prepared by addition of 10 \xl of peptide stock at 100 
^iM (prepared in the reaction buffer) to 90 ^1 of reaction buffer, so that the 
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final concentration of peptide is 10 jiM. Samples comprised of peptide and 
the enzyme are prepared by addition of 10 pi of peptide stock at 100 jiM 
and 10 |jJ of partially purified His-NS3 (183)-A4A-HT stock at 1.7 mg/ml 
(both prepared in the reaction buffer) to 80 fil of reaction buffer, so that the 
5 final concentration of peptide and the enzyme is 10 and 0.1 |iM 

respectively. The reaction is held at 30°C for the specified time and then 
quenched. Quenching is achieved by transferring a 20-^il aliquot of the 
reaction mixture to a new tissue culture plate containing an equal volume 
of PMB Quenching Buffer (50 mM HEPES, pH 7.8, 150 mM Nad, 5 mM P- 

1 0 Hydroxymercuribenzoic Acid, and 13 mM EDTA). 

To prepare the quenched reactior mixture for injection onto the 
sensor surface, 30 jal PMB BIAcore Buffer (50 mM HEPES, pH 7.4, 1 M 
NaCl) and 30 |il of streptavidin at 0.5 mg/ml in water is added to the 40 pi 
15 of the quenched reaction mixture to a final volume of 100 pi. In this step, 
all the peptides are tagged with streptavidin prior to the injection of 
samples. Finally, 4 |ol of this sample is injected over the 
antiphosphotyrosine surface for determination of the intact versus 
cleaved peptide. The final concentration of peptide and the streptavidin 

2 0 in the BIAcore sample is 2 and 9 (iM respectively. 

Experimental Conditions: 



25 Substrate: 



BiQlin-DTED WAC SMSYTWTGK-gX (SEQ ID 
NO 19) in Reaction buffer without DTT 



Concentration: 



170 jiM (Crude peptide, based on weight) 



3 0 Enzyme: 



10 ill of concentrated His-NS3 (183)-A4A-HT 
at 1.7 mg/ml 



Reaction volume: 



100 fil 



3 5 
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Reaction buffer: 



50 mM HEPES, pH 7.8 
20 % glycerol 
150 mM NaCl 
ImM EDTA 
ImM DTT 
0.1% Tween-20 



1 0 



Temp: 

Quench with: 



30° C 



p-hydroxymercuribenzoate 



Example 2 



1 5 Standard Operating Procedure for surface plasmon resonance CMV Assay 



Reactions are prepared in a 96-well tissue culture plate using the 
Reaction Buffer (50 mM HEPES, pH = 7.4 , 25% Glycerol 1 mM DTT ) as 
2 0 diluent. The final reaction volume is 100 |iL. Sample with the peptide 
alone (Biotin-RGWNA SCRLAKY (SEQ ID NO: 31) is prepared by 
addition of 10 pil of peptide stock at 100 mM (prepared in the reaction 
buffer) to 90 |il of reaction buffer, so that the final concentration of peptide 
is 10 jiM. Samples comprised of peptide and the enzyme are prepared by 

2 5 addition of 10 |il of peptide stock at 100 |iM and 10 |il of enzyme stock at 1 

mM (both prepared in the reaction buffer) to 80 yd of reaction buffer, so 
that the final concentration of peptide and the enzyme is 10 and 0.1 \*M 
respectively. The reaction is held at 25° C for the specified time and then 
quenched. Quenching is achieved by transferring a 20-^ aliquot of the 

3 0 reaction mixture to a new tissue culture plate containing an equal volume 

of PMB Quenching Buffer (50 mM HEPES, pH 7.8, 150 mM NaCl, 5 mM p- 
Hydroxymercuribenzoic Acid, and 13 mM EDTA). 

To prepare the quenched reaction mixture for injection onto the 
3 5 sensor surface, 30 ^il PMB BIAcore Buffer (50 mM HEPES, pH 7.4, 1 M 

NaCl) and 30 |il of streptavidin at 0.5 mg/ml in water is added to the 40 \sl 
of the quenched reaction mixture to a final volume of 100 fil. In this step, 
all the peptides are tagged with streptavidin prior to the injection of 
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samples. Finally, 4 pi of this sample is injected over the 
antiphosphotyrosine surface for determination of the intact versus 
cleaved peptide. The final concentration of peptide and the streptavidih 
in the BIAcore sample is 2 and 9 uM respectively. 

5 

Applications of Descr ibed Methodology to CMV Protease 
I. Comparison of HPLC and BIAcore Methodology 

10 

A set of CMV protease reaction samples were analyzed by both 
standard HPLC methodology and the above described BIAcore based 
method. Values for the amount of intact peptide substrate remaining at 
each time point were determined using both methods. The extent of 
1 5 enzyme catalysis as measured by the BIAcore method were identical to 

that measured by the standard HPLC method typically used in the field of 
enzyme catalysis. 

HPLC Substrate : A-G-V-V-N-A-S-S-R-L-A (SEQ ID NO: 32) 

20 

BIAcore Substrate : (Bioun)-R-G-V-V-N-A-S-S-R-L-A-K-(pY) (SEQ ID NO: 
31) 

Substrate ronrpntrarion: 100 uM 

25 

Enzyme: 

CMV Protease See Baum, Ellen et al, }. Virology January 1993 pages 497- 
506. 

3 0 A. Refolded wild type: 0.5 \iM 

Time points : 10, 20, 30, 45, 70, and 100 min 

Temperature: 25° C 

35 
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Reaction c onditions: 

50 mM HEPES, pH 7.5 
5 150 mM NaCl 
1 mM DTT 
25% glycerol 

II. Determination of kinetic parame ters for the hydrolysis of CMV 
1 0 maturation site subst rate by various human CMV protease 

forms 



HCMV protease activity was determined by monitoring the 
cleavage of the peptide using the BIAcore-based methodology as described. 

1 5 The concentration of peptides used for the calculation of kinetics 

parameters was determined .by amino acid composition analysis. 12 
different concentrations of substrate, from 10 uM to 3.2 mM were used for 
km calculation (see below). Hydrolysis at each peptide concentration was 
monitored at six different time points (see below). A total of 96 samples 

2 0 were processed in a 96 well-plate format in 12 hours for determination of 

kinetic parameters for each enzyme. Kinetic parameters (km, V m ax, Kcat) 
were determined by fitting directly the velocity (initial rates at < 15% of the 
total substrate hydrolysis) versus substrate concentration data to Michaelis- 
Menton equation using both BIAevaluation program (Pharmecia 
2 5 Biosensor) and K.cat program (BioMetallics, Inc.). 



Substrate : A-G-V-V-N-A-S-S-R-L-A (SEQ ID NO: 32) 
3 0 Enzyme co ncentration: 
CMV Protease 

A. Refolded wild type (SPA batch #35187-81): 0.2 uM 

B. soluble A143V: 0.1 uM 

35 

C soluble A143V/V209A: 0.1 uM 
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Substrate concentration: 

10, 13, 16, 22, 35, 60, 110, 210, 410, 810, 1610, 3210 |iM 

5 Time points : 1, 5, 10, 15, 20, 25 min 

Temperature: 22° C 

Reaction conditions: 
10 50 mM HEPES, pH 7.5 
150 mM NaCl 
1 mM DTT 
25% glycerol 

15 m. Validation of Enzyme Kinetics and Inhibition Determined Using 
BIAcore-based CMV Assay 

A. Effect of Enzyme Concentration and Time of Reaction on Cleavage 
2 0 of Native Peptide Substrate 

Experimental Conditions : 

Native Peptide: (Biotin)- R-G-V-V-N-A-S-C-R-L-A-(pY) (SEQ ID NO: 

2 5 32) 

Final Concentration of peptide: 10 uM 

Final Enzyme (A143V/V209A): variable 
Time course: variable 
Final DTT: 1 mM 

30 

Summary: 

Using the native substrate, cleavage of the substrate is linear with 
respect to protease concentration (within the 0.02 to 0.08 |xM range) and 

3 5 time (within 1 to 4 hour range). A standard BIAcore-based CMV assay 

using the native substrate is carried out for 2 hours at 0.06 (iM resulting in 
50 % cleavage of the substrate. 
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B. Effect of Enzyme Concentration and Time of Reaction on Cleavage 
pf P2' Serin e Analog Substrate 

5 

Ex perimental Conditions : 

Serine analog: (Biotin)- R-G-V-V-N-A-S-S-R-L-A-(pY) 



Final Concentration of peptide: 10 

1 0 Final Enzyme (A143V/V209A): variable 

Time course: variable 

Final DTT: 1 mM 



Summary: 

1 5 

Using the serine sufrstrate, cleavage of the substrate is linear with 
respect to protease concentration (within the 0.1 to 0.5 |iM range) and time 
(within 1 to 4 hour range). A standard BIAcore-based CMV assay using the 
serine analog is carried out for 2 hours at 0.5 \iM resulting in 50 % cleavage 

2 0 of the substrate. 

C. Effect of Substrate Co ncentration on Rate of Hvdrolvsis 
Experimenta l Conditions: 

25 

Serine analog: (Biotin)- R-G-V-V-N-A-S-S-R-L-A-(pY) 
Final Concentration of peptide: variable 
Final Enzyme (A143V/V209A): 0.1 (iM 

Time course: 2 hours 

3 0 Final DTT 1 mM 

Summary: 

The rate of hydrolysis of CMV peptide substrate is linear with 
3 5 respect to substrate concentrations up to 60 |iM. The concentration of 
substrate used in a standard BIAcore-based assay is 10 jiM. 
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D. Effect of Dithiothreitol on CMV Protease Activity 
Experimentfll Condition?: 

5 Serine analog: (Biotin)- R-G-V-V-N-A-S-£-R-L-A-(pY) 
Final Concentration of peptide: 10 jiM 

Final Enzyme (A143V/V209A): 0.5 \iM 

Time course: variable (2 to 20 hours) 

Final DTT variable (0 to ImM) 

10 

Protease: 

1. Break cells in the presence of DTT 

2. Purify supernatant on Anion Exchange Column in the presence of 
DTT 

15 3. Purify protease on Phenyl Sepharose with and without DTT. 
Summary 

The CMV protease, purified in the absence of DTT lacks catalytic 
activity as judged in a 20- hour assay as compared to an active preparation, 

2 0 purified in the presence of DTT. Addition of DTT to the inactive protease 

results in a complete reactivation of the protease. Thus, presence of DTT 
in the CMV assay is essential for the catalytic activity of CMV protease . 

E. Minimum Requirem ent of DTT for CMV Proteas e Activity 

25 

Experimental Conditions: 

A stock preparation of CMV protease, purified entirely in the 
presence of DTT, and subsequently stored with DTT (final concentration = 

3 0 2 mM) was used. For the assay, titration of DTT from 800 (iM down to 4 

HM was carried out so that the final concentration of enzyme in all 
reaction was 0.04 |iM. All reactions were stopped at various times (1, 2, 3, 
and 4 hours) by addition of equal volume of 5 mM mercuribenzoate. 

35 
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Native Peptide: (Biotin)- R-G-V-V-N-A-S-C-R-L-A-(pY) 
Final Concentration of peptide: 10 uM 

Final Enzyme (A143V/V209A): 0.04 uM 

Time course: variable (1-4 hours) 

5 Final DTT: variable (4 - 800 |iM) 

Final free SH (enzyme and substrate) 10.2 jiM 



Summary: 

1 0 

A 200-fold reduction of DTT concentration from 800 fiM to 4 uM 
lowers the percent cleavage of the native substrate by only 12 % at 2 hours, 
a time period for a standard BIAcore assay resulting in a 50% cleavage of 
the substrate. This reduction in percent cleavage can easily be recovered 
15 by increasing the time of the reaction from 2 to 3 hours. Thus currently, 
the BIAcore based CMV assay can be carried out at DTT concentration as 
low as 4 uM. By increasing the time of the reaction, concentration of DTT 
may further be lowered if desired. 



20 

F. CMV Protease Inhibition Study with Compound 33277-129-2 
assayed in the pre sence of 4 and 800 uM DTT using BIAcore- 
based CMV Assay 



25 

Experimental Conditions: 

Native Peptide: 
(pY) 

3 0 Final Concentration of peptide: 
Final Enzyme (A143V/V209A): 
Time course: 
Final DTT: 
Compound: 

35 

Final concentration of compound: 
Final DMSO: 



(Biotin)- R-G-V-V-N-A-S-C-R-L-A- 

10 uM 
0.04 uM 

variable (1-4 hours) 
4 |lM or 800 uM 

33277-129-2 (Stock: 5.8 mM in 100% 

DMSO) 

12 or 116 uM 

2 % 
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Incubation: Preincubate compound with protease 

for 30 min; Add substrate; 
Incubate for 1-4 hours; Stop reaction. 

5 Summary: 

The compound 33277-129-2 is shown to inhibit the CMV protease 
activity by 40 % at ~ 12 |iM and 60 % at ~ 100 |iM in the presence of 4 ^iM 
DTT. The percent inhibition by this compound is lowered to 33 % at 
10 12 jiM and 40 % at 100 |iM in the presence of 800 mM DTT. 

IV. Competitive Inhibition of CM V Protease Activity by Maturation 

Site Peptide Analog? Monitored on BiAcore 

1 5 

The apparent affinity (km) of the purified double mutant form of 
CMV protease, A143V/V209A for the maturation site peptide, 
AG WNASSRLA (P2' serine analog) was determined to be 900 ^M at 
22° C using the BIAcore-based CMV assay (Table 1). 

20 

Various analogs of this substrate were then characterized in order to 
further define the amino acids crucial for hydrolysis versus those 
involved in enzyme binding. To address this issue, the interaction of 
native maturation site peptide (GWNASCRLA) and four of its mutant 

2 5 analogs with the enzyme were studied by both directly monitoring the 

hydrolysis of each peptide by the enzyme on the HPLC, and the 
competition of each peptide with the biotinylated native substrate using 
the CMV assay developed on the BIAcore. 

3 0 The HPLC results indicate that three out of four analogs tested 

(analogs A, C, D) were not hydrolyzed by the enzyme as monitored on 
HPLC. Two of the three non-hydrolyzable peptides (analogs A, D) were 
synthesized with only a single amino acid mutation where as analog C 
was synthesized so two of its amino acids were mutated to two non-native 
3 5 amino acids. 

The mechanism behind the loss of hydrolysis for analog A and C 
versus analog D was deciphered by the competition studies carried out 
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using the BIAcore-based CMV assay. The results of this assay show that 
the peptide substrate analogs A and C retain their binding ability to the 
enzyme but they are not hydrolyzed. These peptides inhibited the enzyme 
activity by competing with the biotinylated native substrate with a similar 
5 IC-50 to that of the native peptide (IC-50 native -1 mM; IC-50 mutants -1.5 

mM). On the other hand, the results of the competition assay indicate that 
the lack of any observed hydrolysis for the peptide substrate analog D (as 
monitored on HPLC) is due to the loss of its binding ability to the enzyme 
as this peptide is not capable of competing with the biotinylated native 
1 0 peptide as detected on the BIAcore. 

Table 1. 

1 5 APPARENT KINETIC PARAMETERS FOR THE VARIOUS 

RECOMBINANT HUMAN CMV PROTEASE CONSTRUCTS WITH THE 
MATURATION SITE PEPTIDE AGWNASSRLA 



CMV Protease 


Km(nM) 


k C at (min" 1 ) 


kcat/Km (min-l.uM -1 ) 


Refolded Wild 


1023 ± 123 


56 


0.055 


Type 








A143V Mutant 


962 ± 107 


63 


0.066 


A143V/V209A 


893 ±99 


73 


0.082 


Mutant 









20 Example 3 

Production of HC V NS3 Protease 
A. Plasmid constructions. 

25 

Several plasmids were designed and constructed using standard 
recombinant DNA techniques (Sambrook,Fritsch & Maniatis) to express 
the HCV protease in E. coli (Fig 2-7). All HCV specific sequences 
originated from the parental plasmid pBRTM/HCV 1-3011 (Grakoui et 
3 0 aZ.1993). To express the N-terminal 183 amino acid versions of the 

protease, a stop codon was inserted into the HCV genome using synthetic 
oligonucleotides (Fig. 3). The plasmids designed to express the N-terminal 
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246 amino acid residues were generated by the natural Ncol restriction site 
at the C-terminus. 

i) Construction of the plasmid pBJ1015 (Figure 2) 

5 

The plasmid pBRTM/HCV 1-3011 containing the entire HCV genome 
(Grakoui A., et at, /. Virol 67: 1385-1395) was digested with the restriction 
enzymes Sea I and Hpa I and the 7138 bp (base pair) DNA fragment was 
isolated and cloned to the Sma I site of pSP72 (Promega) to produce the 
1 0 plasmid,pRJ201* The plasmid pRJ 201 was digested with Msc I and the 2106 
bp Msc I fragment was isolated and cloned into the Sma I site of the 
plasmid pBD7. The resulting plasmid pMBM48 was digested with Kas I 
and Nco I, and the 734 bp DNA fragment after blunt ending with Klenow 
polymerase was isolated .and cloned into Nco I digested, klenow 

1 5 polymerase treated pTrc HIS B seq expression plasmid (Invitrogen). The 

ligation regenerated a Nco I site at the 5' end and Nsi I site at the 3' end of 
HCV sequence. The plasmid pTHB HCV NS3 was then digested with Nco I 
and Nsi I, and treated with klenow polymerase and T4 DNA polymerase, 
to produce a blunt ended 738 bp DNA fragment which was isolated and 

2 0 cloned into Asp I cut, klenow polymerase treated expression plasmid 

pQE30 (HIV). The resulting plasmid pBJ 1015 expresses HCV NS3 (246 
amino acids) protease. 

(ii) Construction of the plasmid pTS 56-9 with a stop codon after amino 

2 5 acid 183 (Figure 3) 

The plasmid pTHB HCV NS3 was digested with Nco I, treated with 
klenow polymerase, then digested with Bst Y I; and the DNA fragment 
containing HCV sequence was isolated and cloned into Sma I and Bgl II 

3 0 digested pSP72. The resulting plasmid pTS 49-27 was then digested with 

Bgl II and Hpa I and ligated with a double stranded oligonucleotide: 

GA TCA CCG GTC TAG ATCT 

t ggc gag atc tag a (SEQ ID NO 11) to produce pTS 56-9. 
Thus, a stop codon was placed directly at the end of DNA encoding the 
3 5 protease catalytic domain of the NS3 protein. This enabled the HCV 

protease to be expressed independently from the helicase domain of the 
NS3 protein. 
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(iii) Construction of the plasmid pJB 1006 Fused with a peptide of 
. positively charged amino aicids at the carboxy terminus of NS3 183 (Figure 
4). 

5 The plasmid pTS 56-9 was digested with Sph I and Bgl II and the DNA 
fragment containing HCV sequence was isolated and cloned into a Sph I, 
Bgl II cut pSP72. The resulting plasmid pJB 1002 digested with Age I and 
Hpal and ligated to a double stranded oligonucleotide, 

CCG GTC CGG AAG AAA AAG AGA CGC TAG C 
10 AG GCC TTC TTT TTC TCT GCG ATC G 

(SEQ ID NO 12), to construct pJB 1006. This fused the hydrophilic, 
solubilizing motif onto the NS3 protease. 



1 5 (iv) Construction of the plasmid pBJ 1022 expressing His-NS3(183)-HT 

in E.coli (Figure 5) 

The plasmid pJB 1006 was digested with NgoM I and Nhe I and the 216 bp 
DNA fragment was isolated and cloned into Ngo M I, Nhe I cut pBJ 1015 

2 0 to construct plasmid pBJ 1019. The plasmid pBJ 1019 was digested with Nar 

I and Pvu II, and treated with Klenow polymerase to fill in 5' ends of Nar 
I fragments. The expression plasmid pQE31 (Invitrogen) was digested with 
BamH I, blunt ended with Klenow polymerase. The 717 bp Nar I- Pvu II 
DNA fragment was isolated and ligated to the 2787 bp BamH I/Klenowed 

2 5 -Msc I (Bal I) fragment of the expression plasmid pQE31 (Invitrogen). The 

recombinant plasmid, pBJ 1022, obtained after transformation into E.coli 
expresses His NS3(2-183)-HT which does not contain any HIV protease 
cleavage site sequence. The plasmid also contains a large deletion in the 
CAT (Chloramphenicol Acetyl Transferase) gene. 

30 

(v) Construction of the plasmid pNB(-V)182-A4A HT (Figure 6) 

The plasmid pMBM 48 was digested with Eag I and Xho I, treated with 
Klenow polymerase and the 320 bp DNA fragment was isolated and cloned 

3 5 into BamH I cut , blunt ended pSP 72 to construct the plasmid pJB1004. 

The 320 bp fragment encodes 7 amino acid from carboxy terminal of 
NS3(631), all of NS4A, and the amino terminal 46 amino acid of NS4B. 
The recombinant plasmid pJB1004 was digested with Eag I and Cel 2, blunt 
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ended with Klenow polymerase. The 220 bp DNA fragment was isolated 
and cloned into the expression plasmid pQE30 which was digested with 
BamH I and blunt ended with Klenow polymerase prior to ligation. The 
resulting plasmid pJB 1011 was digested with NgoM I and Hind HI and 
5 ligated to a double stranded oligonucleotide , 

CCG GCA ATT ATA CCT GAC AGG GAG GTT CTC TAC CAG GAA TTC 
GT TAA TAT GGA CTG TCC CTC CAA GAG ATG GTC CTT AAG 

1 0 GAT GAG ATG GAA GAG TGC CGG AAG AAA AAG AGA CGC A 

CTA CTC TAC CTT CTC ACG GCC TTC TTT TTC TCT GCG TTC GA 

(SEQ ID NO 13) 

to construct the plasmid pNB 4A HT. The plasmid pNB 4AHT was 

1 5 digested with Msl I and Xba I. The 1218 bp DNA fragment was isolated and 

cloned into Age I cut, klenow polymerase treated, Xba I cut vector DNA of 
pBJ 1019. The ligation results in a substitution of the 183rd amino acid 
residue valine by a glycine residue in NS3, and a deletion of amino 
terminal three amino acid residues of NS4A at the junction. The 

2 0 recombinant plasmid pNB182A4A HT comprising NS3(182aa)-G-NS4A(4- 

54 amino acid) does not contain NS3/NS4A cleavage site sequence at the 
junction and is not cleaved by the autocatalytic activity of NS3. Finally the 
plasmid pNB182A4A HT (SEQ ID NO 8) was digested with Stu I and Nhe I, 
the 803 bp DNA fragment was isolated and cloned into Stu I and Nhe I cut 

2 5 plasmid pBJ 1022. The resulting plasmid pNB(-V)182-A4A HT contains a 

deletion of the HIV sequence from the amino terminus end of the NS3 
sequence and in the CAT gene (SEQ ID NO 27). 

(vi) Construction of the plasmid pT5 His HIV-NS3 (Figure 7) 

30 

The plasmid pTS56-9 was digested with Bgl n, and treated with 
Klenow polymerase to fill in 5' ends. The plasmid was then digested with 
NgoM I and the blunt ended Bgl II/NgoMI fragment containing the NS3 
sequence was isolated and ligated to the Sgll, Klenow treated NgmNfl cut 

3 5 and Sal I klenowed pBJ 1015. The resulting plasmid is designated pTSHis 

HIV 183. 
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Example 4 

Purification of HCV NS3 Protp ase having a Solubilizing Motif 

5 Purification o f Hislfi2HT (SFO ID NO 4) and 

His (-V)1B2A4AHT fSBO ID NO 8) 

The recombinant plasmids pBJ1022 and pNB(-V)182A4A were used 
to transform separate cultures of E. coli strain M15 [pREP4] (Qiagen), which 
1 0 over-expresses the lac repressor, according to methods recommended by 
the manufacturer. M15 [pREP4] bacteria harboring recombinant plasmids 
were grown overnight in broth containing 20g/L bactotrypton, 5g/L bacto- 
yeast extract, lOg/L NaCl and supplemented with 100|ig/ml ampicillin and 
25M.g/ml kanamycin. Cultures were diluted down to O.D.600 of 0.1, then 

1 5 grown at 30°C to O.D.600 of 0.6 to 0.8, after which IPTG was added to a final 

concentration of ImM. At post-induction 2 to 3 hours, the cells were 
harvested by pelleting, and the cell pellets were washed with lOOmM Tris, 
pH 7.5. Cell lysates were prepared as follows: to each ml equivalent of 
pelleted fermentation broth was added 50^il sonication buffer (50 mM 

2 0 sodium phosphate, pH 7.8, 0.3M NaCl) with 1 mg/ml lysozyme; cell 

suspension was placed on ice for 30 min. Suspension was then brought to 
a final concentration of 0.2% Tween-20, lOmM dithiothreitol (DTT), and 
sonicated until cell breakage was complete. Insoluble material was pelleted 
at 12,000 x g in a microcentrifuge for 15 minutes, the soluble portion was 

2 5 removed to a separate tube and the soluble lysate was then brought to a 

final concentration of 10% glycerol. Soluble lysates from cells expressing 
the plasmids produce strongly immunoreactive bands of the predicted 
molecular weight. Soluble lysates prepared for Ni 2+ column purification 
were prepared with lOmM P-mercaptoethanol (BME) instead of DTT. 

3 0 Lysates were stored at -80°C . 

Purification using Ni2±- Nitros v l ac e tic acid (NTA) pgarore fPIAGEN) 

The proteins were then purified by placing the extracted lysate on an 
3 5 NTA agarose column. NTA agarose column chromatography was used 
because the histidine tag which was fused to the N-terminus of the 
proteases readily binds to the nickel column. This produces a powerful 
affinity chromatographic technique for rapidly purifying the soluble 
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protease. The column chromatography was performed in a batch mode. 
The Ni 2+ NTA resin (3ml) was washed twice with 50 ml of Buffer A ( 
50mM sodium phosphate pH 7.8 containing 10% glycerol, 0.2% Tween-20, 
lOmM BME). The lysate obtained from a 250 ml fermentation (12.5 ml) 
5 was incubated with the resin for one hour at 4°C. The flow through was 
collected by centrifugation. The resin was packed into a 1.0 x 4 cm column 
and washed with buffer A until the baseline was reached. The bound 
protein was then eluted with a 20 ml gradient of imidazole (0-0.5M) in 
buffer A. Eluted fractions were evaluated by SDS-PAGE and western blot 
1 0 analysis using a rabbit polyclonal antibody to His-HTV 183. 

Purification using POROS metal-chelate affin ity column 

In an alternative method to purify the proteins the lysate containing the 

1 5 proteins were applied to a POROS metal-chelate affinity column. 

Perfusion chromatography was performed on a POROS MC metal chelate 
column (4.6 x 50mm, 1.7 ml) precharged with Ni 2+ . The sample was 
applied at 10 ml/min and the column was washed with buffer A. The 
column was step eluted with ten column volumes of buffer A containing 

2 0 25 mM imidazole. The column was further eluted with a 25 column 

volume gradient of 25-250 mM imidazole in buffer A. All eluted fractions 
were evaluated by SDS-PAGE and western blot analysis using rabbit 
polyclonal antibody. 

25 

Example 5 

Peptide Synthesis of the 5A/5B and 4B/5A Substrates 

30 

The peptides 5A/5B and 4B/5A substrates (SEQ ID NOs 16, 18, 19, 20 and 
21) were synthesized using Fmoc chemistry on an ABI model 431A peptide 
synthesizer. The manufacture recommended FastMoc™ activation 
strategy (HBTU/HOBt) was used for the synthesis of 4A activator peptide. 

3 5 A more powerful activator, HATU with or without the additive HO At 

were employed to assemble 5A/5B substrate peptides on a preloaded Wang 
resin. The peptides were cleaved off the resin and deprotected by standard 
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TFA cleavage protocol. The peptides were purified on reverse phase HPLC 
and confirmed by mass spectrometry analysis. 

Example 6 

5 

HPLC-assay using a synthetic 5A/5B peptide substrate 

To test the proteolytic activity of the HCV NS3 protease the 
DTEDWCC SMSYTWTGK (SEQ ID NO 16) and soluble HCV NS3 (SEQ ID 
1 0 NO 27) were placed together in an assay buffer. The assay buffer was 

50mM sodium phosphate pH 7.8, containing 15% glycerol, lOmM DTT, 
0.2% Tween20 and 200 mM NaCl). The protease activity of SEQ ID NO 27 
cleaved the substrate into two byproduct peptides, namely 5 A and 5B. The 
substrate and two byproduct peptides were separated on a reversed-phase 

1 5 HPLC column. (Dynamax, 4.6 x 250 mm) with a pore size of 300A and a 

particle size of 5|im. The column was equilibrated with 0,1%TFA (Solvent 
A) at a flow rate of 1 ml per minute. The substrate and the product peptide 
standards were applied to the column equilibrated in A. Elution was 
performed with a acetonitrile gradient (Solvent B=100% acetonitrile in A). 

2 0 Two gradients were used for elution (5% to 70%B in 50 minutes followed 

by 70% to 100%B in 10 minutes). 

In another experiment, partially purified SEQ ID NO 27 or vector 
control was incubated with lOOpM of substrate for 3, 7 and 24 hours at 
2 5 30°C. The reaction mixture was quenched by the addition of TFA to 0.01% 
and applied to the reversed-phase HPLC column. The fractions from each 
run were evaluated by mass spectrometry and sequencing. 



3 0 Example 7 

Rpfolding of Insoluble HCV NS3 Protease 

The present example describes a novel process for the refolding of 
3 5 HCV NS3 protease which does not have a solubilizing motif from an 
E.coli inclusion body pellet. This procedure can be used to generate 
purified enzyme for activity assays and structural studies. 
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Extraction and Purification of His-H IV 183 from the E.coli inclusion body 
pellet 

5 E. coli cells harboring the plasmid for HisHF/183 was used to 

transform a culture of E, coli strain M15 [pREP] (Qiagen), which over- 
expresses the lac repressor, according to methods recommended by 
commercial source. M15 [pREP] bacteria harboring recombinant plasmids 
were grown overnight in 20-10-5 broth supplemented with lOOjig/ml 
1 0 ampicillin and 25|ig/ml kanamycin. Cultures were diluted to O.D.600 of 

0. 1. then grown at 37°C to O.D.600 of 0.6 to 0.8, after which IPTG was added 
to a final concentration of ImM. At post-induction 2 to 3 hours, the cells 
were harvested by pelleting, and the cell pellets were washed with lOOmM 
Tris, pH 7.5. were pelleted by centrifugation. The cell pellet was 

1 5 resuspended in 10 ml of 0.1M Tris-HCl, 5mM EDTA, pH 8.0 (Buffer A) for 

each gm wet weight of pellet. The pellet was homogenized and 
resuspended using a Dounce homogenizer. The suspension was clarified 
by centrifugation at 20,000 x g for 30 minutes at 4°C. The pellet was 
sequentially washed with the following five buffers: 

20 

1. Buffer A 

2. 1.0M sodium chloride (NaCl) in buffer A 

3. 1.0% Triton X-100 in buffer A 

2 5 4. Buffer A 

5. 1.0 M Guanidine HC1 ( GuHCl) in buffer A. 

The washed pellet was solubilized with 5M GuHCl, 1% beta 
mercaptoethanol in buffer A (3 ml per gm wet wt. of pellet) 

3 0 using a Dounce homogenizer and centrifuged at 100,000 x g for 30 minutes 

at 4°C. Purification of denatured HisHIV183 from high molecular weight 
aggregates was accomplished by size exclusion on a SEPHACRYL S-300 gel 
filtration column. 

3 5 In particular, an 8 ml sample of the 5.0M GuHCl E. coli extract was 

applied to a 160 ml Pharmacia S-300 column (1.6 x 100 cm) at a flow rate of 
1.0 ml/min. The column buffer was comprised of 5.0 M GuHCl, 0.1 M 
Tris-HCl, pH 8.0, and 5.0 mM EDTA. The fraction size was 5.0 ml. 
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Appropriate fractions were pooled based on the results of SDS-PAGE, as 
well as N-terminal sequence analysis of the protein transferred to a Pro- 
Blot. 

5 petergent-a ssisted refolding of HCV-PrOtease 

The protein was concentrated by ultrafiltration using a 43 mm 
Amicon YM10 membrane to 1.0 mg per ml in 5M GuHCl, 0.1M Tris-HCl 
pH 8.0, 1.0 mM EDTA, 1.0% beta-mercaptoethanol. It was then diluted 50- 
1 0 fold to 0.1M GuHCl in refolding buffer (100 mM sodium phosphate pH 8.0, 
lOmM DTT, 0.1% lauryl maltoside) and the mixture was incubated on ice 
for at least one hour. A 25 ml sample containing 500 ug of the protein in 
the refolding buffer was applied to a Pro-RPC HR 3/5 reversed phase 
chromatography column. The applied sample contained 500 ug protein in 

1 5 25 ml of refolding buffer. To the column was then applied a solution B 

comprised of 99.9% H2O + 0.1% trifluoroacetic acid (TFA). A 10 ml 
volume of solution C [10% H 2 G\ 90% acetonitrile (AcN) + 0.1% TFA] was 
applied to the column at a 0 - 60% gradient into solution B at a flow rate of 
0.5ml/min. and a fraction size of 0.5ml. The fractions were monitored at 

2 0 A214; 2.0 absorbance units full scale (AUFS). 

Fractions containing the protein (corresponding to peak 1) were 
pooled for renaturation by stepwise dialysis. The fractions were first 
dialysed in 0.1% TFA in 25% glycerol overnight at 4°C ; then dialyzed in 

2 5 0.01% TFA in 25% glycerol overnight at 4°C; then dialyzed in 0.001% TFA 

in 25% glycerol for 3.0 hours; then dialyzed for 3 hours at 4°C in 50 mM 
NaP0 4 , pH 6.0, 10 mM dithiotreitol (DTT) in 25% glycerol. The protein was 
then dialyzed for 3.0 hours at 4°C in 50 mM NaP0 4 , pH 7.0, 0.15 M NaCl, 10 
mM DTT in 25% glycerol; and then finally dialyzed in 50 mM NaPC>4, pH 

3 0 7.8, 0.3 M NaCl, 10 mM DTT, 0.2% Tween 20 in 25% glycerol. This resulted 

in purified, refolded, soluble, active HCV NS3 protease. 

Far UV circular dichroism (CD) analysis of the protein was used to 
monitor the refolding from an acid denatured state to a folded state at 
3 5 neutral pH. The protein recovery was monitored by a UV scan and SDS- 
PAGE analysis. 

Results: 
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Detergent-assisted Rp folding of His-HIV183 

HisHIV183 was quantitatively extracted from an E. coli inclusion 
5 body pellet. SDS-PAGE analysis at the various stages of extraction shows 
that sequential washes are essential to remove significant amounts of the 
contaminating proteins. HisHIV183 was extracted from the washed 
inclusion body pellet in the presence of 5M GuHCl. The 5M GuHCl extract 
was applied to a SEPHACRYL S-300 column and the appropriate fractions 
1 0 were pooled based on SDS-PAGE analysis. The amino acid sequence of the 
first ten residues was verified. 

Refolding was performed at very low concentrations of protein, in 
the presence of DTT, lauryl maltoside and glycerol at 4°C. The diluted 

1 5 protein was concentrated on a Pro-RPC reversed phase column. Two peaks 

were obtained based on the UV and protein profile. Only Peak 1 has 
yielded soluble protein after stepwise dialysis. Far UV CD spectral analysis 
was used to monitor refolding from a denatured state at acid pH to a 
folded state at neutral pH. At pH 7.4, the protein was found to exhibit 

2 0 significant amounts of secondary structure that is consistent with that of 

beta sheet protein. At low pH, the CD spectrum showed that it is fully 
random coil, having a minimal molar ellipticity at 200nm. The ratio of 
this minimum at 200nm to that of the shoulder at 220 nm is 
approximately 4:1. This ratio decreased when the secondary structure 

2 5 formation occurred at neutral pH. 

A UV scan at each step of dialysis showed that the protein recovery 
was >90% up to pH 7.4 and that there was no light scattering effect due to 
protein aggregates. SDS-PAGE analysis also indicated that there was no 

3 0 loss of protein up to pH 7.0 during refolding. Precipitation of protein 

occurred at the last step of dialysis, and the soluble protein was clarified by 
centrifugation. The overall protein recovery was about 0.10%. The 
refolded protein was found to be active in a trans-cleavage assay using the 
in zntro-translated 5A/5B substrate in the presence of 4A peptide. 

35 
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SEQUENCE LISTING 



5 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Schering Corporation 

10 

(ii) TITLE OF INVENTION: Surface Plasmon Resonance Based 
Enzymatic Assay 

(iii) NUMBER OF SEQUENCES: 35 

1 5 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Schering Corp. 

(B) STREET: 2000 Galloping Hill Road 

(C) CITY: Kenilworth 

2 0 P) STATE: New Jersey 

(E) COUNTRY: USA 

(F) ZIP: 07033-0530 

(v) COMPUTER READABLE FORM: 

2 5 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: Apple Macintosh 

(C) OPERATING SYSTEM: Macintosh 7.1 

(D) SOFTWARE: Microsoft Word 5.1a 

3 0 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

3 5 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/440,283 

(B) FILING DATE: 12-MAY-1995 
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(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Lunn, Paul G. 

(B) REGISTRATION NUMBER: 32,743 

(C) REFERENCE/DOCKET NUMBER: JB0513 

5 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 908-298-5061 

(B) TELEFAX: 908-298-5388 

1 0 (2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 549 base pairs 

(B) TYPE: nucleic acid 

1 5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

2 0 (ix) FEATURE: 

(A) NAME /KEY: HCV NS3 Protease 



GCG CCC ATC ACG 
2 5 Ala Pro lie Thr 
1 

TGT ATA ATC ACC 
Cys lie He Thr 

30 

GGT GAG GTC CAG 
Gly Glu Val Gin 

35 

ACG TGC ATC AAT 
Thr Cys He Asn 



GCG TAC GCC CAG CAG ACG 
Ala Tyr Ala Gin Gin Thr 
5 10 

AGC CTG ACT GGC CGG GAC 
Ser Leu Thr Gly Arg Asp 
20 25 

ATC GTG TCA ACT GCT ACC 
He Val Ser Thr Ala Thr 
35 40 

GGG GTA TGC TGG ACT GTC 
Gly Val Cys Trp Thr Val 
50 55 



AGA GGC CTC CTA GGG 45 
Arg Gly Leu Leu Gly 
15 

AAA AAC CAA GTG GAG 9 0 
Lys Asn Gin Val Glu 
30 

CAA ACC TTC CTG GCA 13 5 
Gin Thr Phe Leu Ala 
4 5 

TAC CAC GGG GCC GGA 180 
Tyr His Gly Ala Gly 
60 
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5 ACG AGO ACC ATC GCA TCA CCC AAG GGT CCT GTC ATC CAG ATG TAT 225 
Thr Arg Thr lie Ala Ser Pro Lys Gly Pro Val lie Gin Met Tyr 
65 70 75 

ACC AAT GTG GAC CAA GAC CTT GTG GGC TGG CCC GCT CCT CAA GGT 270 

1 0 Thr Asn Val Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly 

80 85 90 

TCC CGC TCA TTG ACA CCC TGC ACC TGC GGC TCC TCG GAC CTT TAC 315 
Ser Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr 
15 95 100 105 

CTG GTT ACG AGG CAC GCC GAC GTC ATT CCC GTG CGC CGG CGA GGT 3 60 

Leu Val Thr Arg His Ala Asp Val lie Pro Val Arg Arg Arg Gly 

110 115 120 

20 

GAT AGC AGG GGT AGC CTG CTT TCG CCC CGG CCC ATT TCC TAC CTA 40 5 

Asp Ser Arg Gly Ser Leu Leu Ser Pro Arg Pro lie Ser Tyr Leu 

125 130 135 

2 5 AAA GGC TCC TCG GGG GGT CCG CTG TTG TGC CCC GCG GGA CAC GCC 450 

Lys Gly Ser Ser Gly Gly Pro Leu Leu Cys Pro Ala Gly His Ala 
140 145 150 

GTG GGC CTA TTC AGG GCC GCG GTG TGC ACC CGT GGA GTG ACC AAG 49 5 

3 0 Val Gly Leu Phe Arg Ala Ala Val Cys Thr Arg Gly Val Thr Lys 

155 160 165 

GCG GTG GAC TTT ATC CCT GTG GAG AAC CTA GAG ACA ACC ATG AGA 54 0 
Ala Val Asp Phe lie Pro Val Glu Asn Leu Glu Thr Thr Met Arg 
3 5 170 175 180 

TCC CCG GTG 
Ser Pro Val 
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(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

1 0 (ii) MOLECULE TYPE: peptide 



Arg Lys Lys Lys Arg Arg 

(2) INFORMATION FOR SEQ ID NO:3: 

1 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 567 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

2 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 
2 5 (A) NAME/KEY: 



30 



35 



GCG CCC ATC ACG GCG TAC GCC CAG CAG ACG AGA GGC CTC CTA GGG 45 
Ala Pro lie Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly 
15 10 15 

TGT ATA ATC ACC AGC CTG ACT GGC CGG GAC AAA AAC CAA GTG GAG 90 
Cys lie lie Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu 
20 25 30 



GGT GAG GTC CAG ATC GTG TCA ACT GCT ACC CAA ACC TTC CTG GCA 13 5 
Gly Glu Val Gin lie Val Ser Thr Ala Thr Gin Thr Phe Leu Ala 
35 40 45 
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■ ACG TGC ATC AAT GGG GTA TGC TGG ACT GTC TAC CAC GGG GCC GGA 180 
Thr Cys lie Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly 
50 55 60 

5 

ACG AGG ACC ATC GCA TCA CCC AAG GGT CCT GTC ATC CAG ATG TAT 22 5 
Thr Arg Thr He Ala Ser Pro Lys Gly Pro Val He Gin Met Tyr 
65 70 75 

1 0 ACC AAT GTG GAC CAA GAC CTT GTG GGC TGG CCC GCT CCT CAA GGT 27 0 
Thr Asn Val Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly 
80 85 90 

TCC CGC TCA TTG ACA CCC TGC ACC TGC GGC TCC TCG GAC CTT TAC 315 
1 5 Ser Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr 

95 100 105 

CTG GTT ACG AGG CAC GCC GAC GTC ATT CCC GTG CGC CGG CGA GGT 3 60 
Leu Val Thr Arg His Ala Asp Val He Pro Val Arg Arg Arg Gly 
20 110 115 120 

GAT AGC AGG GGT AGC CTG CTT TCG CCC CGG CCC ATT TCC TAC CTA 4 05 

Asp Ser Arg Gly Ser Leu Leu Ser Pro Arg Pro He Ser Tyr Leu 

125 130 135 

25 

AAA GGC TCC TCG GGG GGT CCG CTG TTG TGC CCC GCG GGA CAC GCC 45 0 

Lys Gly Ser Ser Gly Gly Pro Leu Leu Cys Pro Ala Gly His Ala 

140 145 150 

3 0 GTG GGC CTA TTC AGG GCC GCG GTG TGC ACC CGT GGA GTG ACC AAG 49 5 
Val Gly Leu Phe Arg Ala Ala Val Cys Thr Arg Gly Val Thr Lys 
155 160 165 

GCG GTG GAC TTT ATC CCT GTG GAG AAC CTA GAG ACA ACC ATG AGA 54 0 
3 5 Ala Val Asp Phe He Pro Val Glu Asn Leu Glu Thr Thr Met Arg 

170 175 180 

TCC CCG GTG AGA AAG AAG AAG AGA AGA 
Ser Pro Val Arg Lys Lys Lys Arg Arg 



:WO 9635806 A 1 I > 



WO 96/35806 



PCT/US96/06385 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 603 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

1 0 (ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: pBJ1022(His/NS3 (182)/H.T. 

15 

ATG AGA GGA TCG CAT CAC CAT CAC CAT CAC ACG GAT CCG CCC ATC 45 
Met Arg Gly Ser His His His His His His Thr Asp Pro Pro lie 
15 10 15 

2 0 ACG GCG TAC GCC CAG CAG ACG AGA GGC CTC CTA GGG TGT ATA ATC 9 0 

Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly Cys lie lie 
20 25 30 

ACC AGC CTG ACT GGC CGG GAC AAA AAC CAA GTG GAG GGT GAG GTC 13 5 

2 5 Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu Gly Glu Val 

35 40 45 

CAG ATC GTG TCA ACT GCT ACC CAA ACC TTC CTG GCA ACG TGC ATC 180 
Gin lie Val Ser Thr Ala Thr Gin Thr Phe Leu Ala Thr Cys He 

3 0 50 55 60 

AAT GGG GTA TGC TGG ACT GTC TAC CAC GGG GCC GGA ACG AGG ACC 225 

Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Thr Arg Thr 

65 70 75 

35 

ATC GCA TCA CCC AAG GGT CCT GTC ATC CAG ATG TAT ACC AAT GTG 27 0 

He Ala Ser Pro Lys Gly Pro Val He Gin Met Tyr Thr Asn Val 

80 85 90 
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GAC CAA GAC CTT GTG GGC TGG CCC GCT CCT CAA GGT TCC CGC TCA 315 

Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly Ser Arg Ser 

95 100 105 

5 

TTG ACA CCC TGC ACC TGC GGC TCC TCG GAC CTT TAC CTG GTT ACG 3 60 

Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr 

110 115 120 

1 0 AGG CAC GCC GAC GTC ATT CCC GTG CGC CGG CGA GGT GAT AGC AGG 405 
Arg His Ala Asp Val lie Pro Val Arg Arg Arg Gly Asp Ser Arg 
125 130 135 

GGT AGC CTG CTT TCG CCC CGG CCC ATT TCC TAC CTA AAA GGC TCC 450 
1 5 Gly Ser Leu Leu Ser Pro Arg Pro lie Ser Tyr Leu Lys Gly Ser 

140 145 150 

TCG GGG GGT CCG CTG TTG TGC CCC GCG GGA CAC GCC GTG GGC CTA 495 
Ser Gly Gly Pro Leu Leu Cys Pro Ala Gly His Ala Val Gly Leu 
20 155 160 165 

TTC AGG GCC GCG GTG TGC ACC CGT GGA GTG ACC AAG GCG GTG GAC 54 0 

Phe Arg Ala Ala Val Cys Thr Arg Gly Val Thr Lys Ala Val Asp 
170 175 180 

25 

TTT ATC CCT GTG GAG AAC CTA GAG ACA ACC ATG AGA TCC CCG GTG 585 

Phe lie Pro Val Glu Asn Leu Glu Thr Thr Met Arg Ser Pro Val 
185 190 195 

3 0 AGA AAG AAG AAG AGA AGA 
Arg Lys Lys Lys Arg Arg 



3 5 (2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unear 

(ii) MOLECULE TYPE: cDNA 

5 

(ix) FEATURE: 

(A) NAME/KEY: pT5His/HTV/183 No solubilizing motif 

ATG AGA GGA TCG CAT CAC CAT CAC CAT CAC GGA TCC CAT AAG GCA 45 
1 0 Met Arg Gly Ser His His His His His His Gly Ser His Lys Ala 
15 10 15 

AGA GTT TTG GCT GAA GCA ATG AGC CAT GGT ACC ATG GCG CCC ATC 90 
Arg Val Leu Ala Glu Ala Met Ser His Gly Thr Met Ala Pro lie 

1 5 20 25 30 

ACG GCG TAC GCC CAG CAG ACG AGA GGC CTC CTA GGG TGT ATA ATC 13 5 

Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly Cys lie lie 

35 40 45 

20 

ACC AGC CTG ACT GGC CGG GAC AAA AAC CAA GTG GAG GGT GAG GTC 180 

Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu Gly Glu Val 

50 55 60 

2 5 CAG ATC GTG TCA ACT GCT ACC CAA ACC TTC CTG GCA ACG TGC ATC 225 

Gin lie Val Ser Thr Ala Thr Gin Thr Phe Leu Ala Thr Cys He 
65 70 75 

AAT GGG GTA TGC TGG ACT GTC TAC CAC GGG GCC GGA ACG AGG ACC 27 0 

3 0 Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Thr Arg Thr 

80 85 90 

ATC GCA TCA CCC AAG GGT CCT GTC ATC CAG ATG TAT ACC AAT GTG 315 
He Ala Ser Pro Lys Gly Pro Val He Gin Met Tyr Thr Asn Val 
3 5 95 100 105 
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GAC CAA GAC CTT GTG GGC TGG CCC GCT CCT CAA GGT TCC CGC TCA 3 60 

Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly Ser Arg Ser 

110 115 120 

5 TTG ACA CCC TGC ACC TGC GGC TCC TCG GAC CTT TAC CTG GTT ACG 405 

Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr 

125 130 135 

AGG CAC GCC GAC GTC ATT CCC GTG CGC CGG CGA GGT GAT AGC AGG 45 0 

1 0 Arg His Ala Asp Val lie Pro Val Arg Arg Arg Gly Asp Ser Arg 

140 145 150 

GGT AGC CTG CTT TCG CCC CGG CCC ATT TCC TAC CTA AAA GGC TCC 49 5 

Gly Ser Leu Leu Ser Pro Arg Pro lie Ser Tyr Leu Lys Gly Ser 

1 5 155 160 165 

TCG GGG GGT CCG CTG TTG TGC CCC GCG GGA CAC GCC GTG GGC CTA 54 0 

Ser Gly Gly Pro Leu Leu Cys Pro Ala Gly His Ala Val Gly Leu 

170 175 180 

20 

TTC AGG GCC GCG GTG TGC ACC CGT GGA GTG ACC AAG GCG GTG GAC 585 

Phe Arg Ala Ala Val Cys Thr Arg Gly Val Thr Lys Ala Val Asp 

185 190 195 

2 5 TTT ATC CCT GTG GAG AAC CTA GAG ACA ACC ATG AGA TCC CCG GTG 63 0 

Phe He Pro Val Glu Asn Leu Glu Thr Thr Met Arg Ser Pro Val 

200 205 210 



30 

(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 162 base pairs 
3 5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME/KEY: NS4A 

5 AGC ACC TGG GTG CTC GTT GGC GGC GTC CTG GCT GCT CTG GCC GCG 45 
Ser Thr Trp Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala 
15 10 15 

TAT TGC CTG TCA ACA GGC TGC GTG GTC ATA GTG GGC AGG ATT GTC 90 
1 0 Tyr Cys Leu Ser Thr Gly Cys Val Val lie Val Gly Arg He Val 

20 25 30 

TTG TCC GGG AAG CCG GCA ATT ATA CCT GAC AGG GAG GTT CTC TAC 13 5 
Leu Ser Gly Lys Pro Ala He He Pro Asp Arg Glu Val Leu Tyr 
1 5 35 40 45 

CAG GAG TTC GAT GAG ATG GAA GAG TGC 162 
Gin Glu Phe Asp Glu Met Glu Glu Cys 
50 



20 



30 



(2) INFORMATION FOR SEQ ID NO:7: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 702 base pairs 
2 5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY: NS3 +NS4A 



3 5 GCG CCC ATC ACG GCG 
Ala Pro lie Thr Ala 
1 5 



TAC GCC CAG CAG ACG 
Tyr Ala Gin Gin Thr 
10 



AGA GGC CTC CTA GGG 45 
Arg Gly Leu Leu Gly 
15 



<WO 9 635806 A 1 _l_> 



37 

WO 96/35806 PCT/US96/06385 

TGT ATA ATC ACC AGC CTG ACT GGC CGG GAC AAA AAC CAA GTG GAG 9 0 

Cys lie lie Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu 
20 25 30 

5 GGT GAG GTC CAG ATC GTG TCA ACT GCT ACC CAA ACC TTC CTG GCA 13 5 

Gly Glu Val Gin He Val Ser Thr Ala Thr Gin Thr Phe Leu Ala 
35 40 45 

ACG TGC ATC AAT GGG GTA TGC TGG ACT GTC TAC CAC GGG GCC GGA 18 0 

1 0 Thr Cys He Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly 

50 55 60 



ACG AGG ACC ATC GCA TCA CCC AAG GGT CCT GTC ATC CAG ATG TAT 22 5 
Thr Arg Thr He Ala Ser Pro Lys Gly Pro Val lie Gin Met Tyr 
1 5 65 70 75 



ACC AAT GTG GAC CAA GAC CTT GTG GGC TGG CCC GCT CCT CAA GGT 27 0 

Thr Asn Val Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly 

80^ 85 90 

20 

TCC CGC TCA TTG ACA CCC TGC ACC TGC GGC TCC TCG GAC CTT TAC 315 

Ser Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr 

95 100 105 

2 5 CTG GTT ACG AGG CAC GCC GAC GTC ATT CCC GTG CGC CGG CGA GGT 3 60 

Leu Val Thr Arg His Ala Asp Val He Pro Val Arg Arg Arg Gly 

110 115 120 

GAT AGC AGG GGT AGC CTG CTT TCG CCC CGG CCC ATT TCC TAC CTA 4 05 

3 0 Asp Ser Arg Gly Ser Leu Leu Ser Pro Arg Pro He Ser Tyr Leu 

125 130 135 



AAA GGC TCC TCG GGG GGT CCG CTG TTG TGC CCC GCG GGA CAC GCC 450 
3 5 Lys Gly Ser Ser Gly Gly Pro Leu Leu Cys Pro Ala Gly His Ala 

140 145 isQ 



3NSDOCID: <WO 9635806A1J_> 



3 8 

WO 96/35806 PCT7US9 6/0638 5 

GTG GGC CTA TTC AGG GCC GCG GTG TGC ACC CGT GGA GTG ACC AAG 49 5 

Val Gly Leu Phe Arg Ala Ala Val Cys Thr Arg Gly Val Thr Lys 
155 160 165 



5 GCG GTG GAC TTT ATC 
Ala Val Asp Phe He 
170 



CCT GTG GAG AAC CTA 
Pro Val Glu Asn Leu 
175 



GAG AC A ACC ATG AGA 54 0 
Glu Thr Thr Met Arg 
180 



TCC CCG GGG GTG CTC 
1 0 Ser Pro Gly Val Leu 

185 



GTT GGC GGC GTC CTG 
Val Gly Gly Val Leu 
190 



GCT GCT CTG GCC GCG 585 
Ala Ala Leu Ala Ala 
195 



TAT TGC CTG TCA ACA GGC TGC GTG GTC ATA GTG GGC AGG ATT GTC 63 0 
Tyr Cys Leu Ser Thr Gly Cys Val Val He Val Gly Arg He Val 
1 5 200 205 210 

TTG TCC GGG AAG CCG GCA ATT ATA CCT GAC AGG GAG GTT CTC TAC 67 5 
Leu Ser Gly Lys Pro Ala He lie Pro Asp Arg Glu Val Leu Tyr 
215 220 225 

20 

CAG GAG TTC GAT GAG ATG GAA GAG TGC 7 02 
Gin Glu Phe Asp Glu Met Glu Glu Cys 
230 



25 



35 



(2) INFORMATION FOR SEQ ID NO:8: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 855 base pairs 
3 0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: pNB182A4AHT 
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ATG AGA GGA TCG CAT CAC CAT CAC CAT CAC GGA TCC CAT AAG GCA 45 
Met Arg Gly Ser His His His His His His Gly Ser His Lys Ala 
15 10 15 

5 AGA GTT TTG GCT GAA GCA ATG AGC CAT GGT ACC ATG GCG CCC ATC 90 
Arg Val Leu Ala Glu Ala Met Ser His Gly Thr Met Ala Pro lie 
20 25 30 

ACG GCG TAC GCC CAG CAG ACG AGA GGC CTC CTA GGG TGT ATA ATC 13 5 
1 0 Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly Cys lie lie 

35 40 45 

ACC AGC CTG ACT GGC CGG GAC AAA AAC CAA GTG GAG GGT GAG GTC 180 
Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu Gly Glu Val 
1 5 50 55 60 

CAG ATC GTG TCA ACT GCT ACC CAA ACC TTC CTG GCA ACG TGC ATC 225 
Gin lie Val Ser Thr Ala Thr Gin Thr Phe Leu Ala Thr Cys He 
65 70 75 



20 



25 



AAT GGG GTA TGC TGG ACT GTC TAC CAC GGG GCC GGA ACG AGG ACC 27 0 
Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Thr Arg Thr 
80 85 90 



ATC GCA TCA CCC AAG GGT CCT GTC ATC CAG ATG TAT ACC AAT GTG 315 
He Ala Ser Pro Lys Gly Pro Val He Gin Met Tyr Thr Asn Val 
95 100 105 

3 0 GAC CAA GAC CTT GTG GGC TGG CCC GCT CCT CAA GGT TCC CGC TCA 3 60 
Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly Ser Arg Ser 
HO 115 120 



3 5 TTG ACA CCC TGC ACC TGC GGC TCC TCG GAC CTT TAC CTG GTT ACG 40 5 
Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr 
125 130 135 
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AGG CAC GCC GAC GTC ATT CCC GTG CGC CGG CGA GGT GAT AGC AGG 45 0 
Arg His Ala Asp Val He Pro Val Arg Arg Arg Gly Asp Ser Arg 
140 145 150 

5 GGT AGC CTG CTT TCG CCC CGG CCC ATT TCC TAC CTA AAA GGC TCC 495 
Gly Ser Leu Leu Ser Pro Arg Pro He Ser Tyr Leu Lys Gly Ser 
155 160 165 

TCG GGG GGT CCG CTG TTG TGC CCC GCG GGA CAC GCC GTG GGC CTA 540 
1 0 Ser Gly Gly Pro Leu Leu Cys Pro Ala Gly His Ala Val Gly Leu 

170 175 180 

TTC AGG GCC GCG GTG TGC ACC CGT GGA GTG ACC AAG GCG GTG GAC 585 
Phe Arg Ala Ala Val Cys Thr Arg Gly Val Thr Lys Ala Val Asp 

1 5 185 190 195 

TTT ATC CCT GTG GAG AAC CTA GAG AC A ACC ATG AGA TCC CCG GGG 63 0 

Phe He Pro Val Glu Asn Leu Glu Thr Thr Met Arg Ser Pro Gly 

20.0 205 210 

20 

GTG CTC GTT GGC GGC GTC CTG GCT GCT CTG GCC GCG TAT TGC CTG 720 

Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu 

215 220 225 

2 5 TCA AC A GGC TGC GTG GTC ATA GTG GGC AGG ATT GTC TTG TCC GGG 7 65 

Ser Thr Gly Cys Val Val He Val Gly Arg He Val Leu Ser Gly 
230 235 240 

AAG CCG GCA ATT ATA CCT GAC AGG GAG GTT CTC TAC CAG GAG TTC 810 

3 0 Lys Pro Ala He He Pro Asp Arg Glu Val Leu Tyr Gin Glu Phe 

245 250 255 

GAT GAG ATG GAA GAG TGC CGG AAG AAA AAG AGA CGC AAG CTT AAT 855 
Asp Glu Met Glu Glu Cys Arg Lys Lys Lys Arg Arg Lys Leu Asn 
3 5 260 265 270 
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(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 711 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



10 

(ix) FEATURE: 
(A) NAME /KEY: 

GCG CCC ATC ACG GCG TAC GCC CAG CAG ACG AGA GGC CTC CTA GGG 45 
Ala Pro lie Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly 
15 1 5 10 15 

TGT ATA ATC ACC AGC CTG ACT GGC CGG GAC AAA AAC CAA GTG GAG 9 0 

Cys lie He Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu 
20 25 30 

20 

GGT GAG GTC CAG ATC GTG TCA ACT GCT ACC CAA ACC TTC CTG GCA 13 5 

Gly Glu Val Gin He Val Ser Thr Ala Thr Gin Thr Phe Leu Ala 
35 40 45 

2 5 ACG TGC ATC AAT GGG GTA TGC TGG ACT GTC TAC CAC GGG GCC GGA 180 

Thr Cys He Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly 
50 55 60 

ACG AGG ACC ATC GCA TCA CCC AAG GGT CCT GTC ATC CAG ATG TAT 225 

3 0 Thr Arg Thr He Ala Ser Pro Lys Gly Pro Val He Gin Met Tyr 

65 70 75 

ACC AAT GTG GAC CAA GAC CTT GTG GGC TGG CCC GCT CCT CAA GGT 27 0 
Thr Asn Val Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly 
3 5 80 85 90 

TCC CGC TCA TTG ACA CCC TGC ACC TGC GGC TCC TCG GAC CTT TAC 315 
Ser Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr 
95 100 105 
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CTG GTT ACG AGG CAC GCC GAC GTC ATT CCC GTG CGC CGG CGA GGT 3 60 
Leu Val Thr Arg His Ala Asp Val He Pro Val Arg Arg Arg Gly 
110 115 120 

5 

GAT AGC AGG GGT AGC CTG CTT TCG CCC CGG CCC ATT TCC TAC CTA 405 
Asp Ser Arg Gly Ser Leu Leu Ser Pro Arg Pro He Ser Tyr Leu 
125 130 135 

1 0 AAA GGC TCC TCG GGG GGT CCG CTG TTG TGC CCC GCG GGA CAC GCC 450 
Lys Gly Ser Ser Gly Gly Pro Leu Leu Cys Pro Ala Gly His Ala 
140 145 150 

GTG GGC CTA TTC AGG GCC GCG GTG TGC ACC CGT GGA GTG ACC AAG 49 5 
1 5 Val Gly Leu Phe Arg Ala Ala Val Cys Thr Arg Gly Val Thr Lys 

155 160 165 

GCG GTG GAC TTT ATC CCT GTG GAG AAC CTA GAG AC A ACC ATG AGA 540 
Ala Val Asp Phe He Pro Val Glu Asn Leu Glu Thr Thr Met Arg 
20 170 175 180 

TCC CCG GGG GTG CTC GTT GGC GGC GTC CTG GCT GCT CTG GCC GCG 58 5 
Ser Pro Gly Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala 
185 190 195 

25 

TAT TGC CTG TCA AC A GGC TGC GTG GTC ATA GTG GGC AGG ATT GTC 63 0 
Tyr Cys Leu Ser Thr Gly Cys Val Val He Val Gly Arg He Val 
200 205 210 

3 0 TTG TCC GGG AAG CCG GCA ATT ATA CCT GAC AGG GAG GTT CTC TAC 67 5 
Leu Ser Gly Lys Pro Ala He He Pro Asp Arg Glu Val Leu Tyr 
215 220 225 

CAG GAG TTC GAT GAG ATG GAA GAG AAG GAG AC A GAG 
3 5 Gin Glu Phe Asp Glu Met Glu Glu Lys Glu Thr Glu 

230 

(2) INFORMATION FOR SEQ ID NO:10: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 855 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNTESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

1 0 (ix) FEATURE: 

(A) NAME/KEY: 



ATG AGA GGA TCG CAT CAC CAT CAC CAT CAC ACG GAT CCG GCG CCC 
Met Arg Gly Ser His His His His His His Thr Asp Pro Ala Pro 
1 5 1 5 10 15 



20 



ATC ACG GCG TAC GCC CAG CAG ACG AGA GGC CTC CTA GGG TGT ATA 45 

lie Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly Cys lie 

20 25 30 

ATC ACC AGC CTG ACT GGC CGG GAC AAA AAC CAA GTG GAG GGT GAG 9 0 

lie Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu Gly Glu 

35 40 45 

2 5 GTC CAG ATC GTG TCA ACT GCT ACC CAA ACC TTC CTG GCA ACG TGC 13 5 

Val Gin lie Val Ser Thr Ala Thr Gin Thr Phe Leu Ala Thr Cys 

50 55 60 



ATC AAT GGG GTA TGC TGG ACT GTC TAC CAC GGG GCC GGA ACG AGG 180 
3 0 lie Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Thr Arg 

65 70 75 



ACC ATC GCA TCA CCC AAG GGT CCT GTC ATC CAG ATG TAT ACC AAT 225 
Thr lie Ala Ser Pro Lys Gly Pro Val lie Gin Met Tyr Thr Asn 
3 5 80 85 90 
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GTG GAC CAA 
Val Asp Gin 



5 TCA TTG ACA 
Ser Leu Thr 



GAC CTT GTG 
Asp Leu Val 
95 

CCC TGC ACC 
Pro Cys Thr 
110 



44 

GGC TGG CCC GCT 
Gly Trp Pro Ala 
100 

TGC GGC TCC TCG 
Cys Gly Ser Ser 
115 



i 

CCT CAA GGT TCC 
Pro Gin Gly Ser 

GAC CTT TAC CTG 
Asp Leu Tyr Leu 
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CGC 27 0 

Arg 

105 

GTT 315 

Val 

120 



ACG AGG CAC GCC GAC GTC ATT 
1 0 Thr Arg His Ala Asp Val lie 

125 

AGG GGT AGC CTG CTT TCG CCC 
Arg Gly Ser Leu Leu Ser Pro 

1 5 140 

TCC TCG GGG GGT CCG CTG TTG 
Ser Ser Gly Gly Pro Leu Leu 

2 0 CTA TTC AGG GCC GCG GTG TGC 

Leu Phe Arg Ala Ala Val Cys 
170 

GAC TTT ATC CCT GTG GAG AAC 
2 5 Asp Phe lie Pro Val Glu Asn 

185 



CCC GTG CGC CGG CGA GGT GAT AGC 3 60 
Pro Val Arg Arg Arg Gly Asp Ser 
130 135 

CGG CCC ATT TCC TAC CTA AAA GGC 40 5 
Arg Pro lie Ser Tyr Leu Lys Gly 
145 150 

TGC CCC GCG GGA CAC GCC GTG GGC 450 
Cys Pro Ala Gly His Ala Val Gly 

ACC CGT GGA GTG ACC AAG GCG GTG 495 
Thr Arg Gly Val Thr Lys Ala Val 
175 180 

CTA GAG ACA ACC ATG AGA TCC CCG 540 
Leu Glu Thr Thr Met Arg Ser Pro 
190 195 



GGG GTG CTC GTT GGC GGC GTC CTG GCT GCT CTG GCC GCG TAT TGC 585 
Gly Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys 
3 0 200 205 210 

CTG TCA ACA GGC TGC GTG GTC ATA GTG GGC AGG ATT GTC TTG TCC 63 0 
Leu Ser Thr Gly Cys Val Val He Val Gly Arg He Val Leu Ser 
215 220 225 

35 

GGG AAG CCG GCA ATT ATA CCT GAC AGG GAG GTT CTC TAC CAG GAG 67 5 
Gly Lys Pro Ala He He Pro Asp Arg Glu Val Leu Tyr Gin Glu 
230 235 240 
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TTC GAT GAG ATG GAA GAG AAG GAG AC A GAG 705 
Phe Asp Glu Met Glu Glu Lys Glu Thr Glu 
245 250 

5 (2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

1 0 (C) STRANDEDNESS: double 

(D) TOPOLOGY: double 

(ii) MOLECULE TYPE: cDNA 

15 GA TCA CCG GTC TAG ATCT 

T GGC CAG ATC TAGA 

(2) INFORMATION FOR SEQ ID NO:12: 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 
(A) NAME/KEY: 

30 

CCG GTC CGG AAG AAA AAG AGA CGC TAG C 
AG GCC TTC TTT TTC TCT GCG ATC G 



35 

(2) INFORMATION FOR SEQ ID NO:13: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 79 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 
(A) NAME/KEY: 

1 0 

CCG GCA ATT ATA CCT GAC AGG GAG GTT CTC 'TAC CAG GAA TTC 
GT TAA TAT GGA CTG TCC CTC CAA GAG ATG GTC CTT AAG 

GAT GAG ATG GAA GAG TGC CGG AAG AAA AAG AGA CGC A 

1 5 CTA CTC TAC CTT CTC ACG GCC TTC TTT TTC TCT GCG TTC GA 

(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS: 

2 0 (A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

2 5 (ii) MOLECULE TYPE: polypeptide 

(ix) FEATURE: 

(A) NAME/KEY: NS4A Active Mutant 

3 0 Gly Cys Val Val lie Val Gly Arg He Val Leu Ser Gly Lys 

5 10 

(2) INFORMATION FOR SEQ ID NO:15: 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 



<WO 9635806 A 1 _[_> 



WO 96/35806 

(D) TOPOLOGY: linear 



47 
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(ii) MOLECULE TYPE: polypeptide 

5 (ix) FEATURE: 

(A) NAME/KEY: NS4A Active Mutant 

Cys Val Val lie Val Gly Arg lie Val Leu Ser Gly Lys 
5 10 

10 

(2) INFORMATION FOR SEQ ID NO:16: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 amino acids 

1 5 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

20 

(ix) FEATURE: 

(A) NAME/KEY: Soluble 5A/5B Substrate 

Asp Thr Glu Asp Val Val Cys Cys Ser Met Ser Tyr Thr Trp Thr 
25 5 10 15 

Gly Lys 

(2) INFORMATION FOR SEQ ID NO:17: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: polypeptide 
(ix) FEATURE: 
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(A) NAME /KEY: Mutant 5A/5B Substrate 



Asp Thr Glu Asp Val Val Ala Cys Ser Met Ser Tyr Thr Trp Thr 
5 10 15 

5 Gly 

(2) INFORMATION FOR SEQ ID NO:18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 amino acids 
1 0 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



1 5 



30 



35 



(ii) MOLECULE TYPE: polypeptide 

(ix) FEATURE: 

(A) NAME/KEY: Mutant Soluble 5A/5B Substrate 



Asp Thr Glu Asp Val Val Ala Cys Ser Met Ser Tyr Thr Trp Thr 
2 0 5 10 15 

Gly Lys 

2) INFORMATION FOR SEQ ID NO:19: 

2 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: polypeptide 

(ix) FEATURE: 

(A) NAME/KEY: Soluble 5A/5B Substrate 

Asp Thr Glu Asp Val Val Cys Cys Ser Met Ser Tyr Thr Trp Thr 
5 10 15 

Gly Lys Tyr 
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2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

1 0 (ii) MOLECULE TYPE: polypeptide 

(ix) FEATURE: 

(A) NAME/KEY: Soluble 5A/5B Substrate 

1 5 

Asp Thr Glu Asp Val Val Ala Cys Ser Met Ser Tyr Thr Trp Thr 

5 10 15 

Gly Lys Tyr 

2 0 2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

2 5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

3 0 (ix) FEATURE: 

(A) NAME /KEY: Soluble 4B/5A Substrate 

Trp He Ser Ser Glu Cys Thr Thr Pro Cys Ser Gly Ser Trp Leu 

5 10 15 

3 5 Arg Asp He Trp Asp 

2) INFORMATION FOR SEQ ID NO:22: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

(ix) FEATURE: 

1 0 (A) NAME /KEY: histidine tag 

Met Arg Gly Ser His His His His His His Thr Asp Pro 

5 10 

15 2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

2 0 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

2 5 (ix) FEATURE: 

(A) NAME /KEY: hydrophilic tail 

Arg Lys Lys Lys Arg Arg Lys Leu Asn 

5 

30 

2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 

3 5 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: polypeptide 

(ix) FEATURE: 

(A) NAME/KEY: hydrophilic tail 

5 

Lys Glu Thr Glu 

2) INFORMATION FOR SEQ ID NO:25: 

1 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

15 

(ii) MOLECULE TYPE: polypeptide 

(ix) FEATURE: 

(A) NAME /KEY: hydrophilic tail 

20 

Trp lie Ser Ser Glu Cys Thr Thr Pro Cys Ser Gly Ser Trp Leu 

5 10 15 

2 5 Arg Asp lie Trp Asp 

20 

(2) INFORMATION FOR SEQ ID NO:26: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 
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(A) NAME /KEY: NS4A Mutant 



GTG CTC GTT GGC GGC GTC CTG GCT GCT CTG GCC GCG TAT TGC CTG 45 

Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu 
5 1 5 10 15 

TCA ACA GGC TGC GTG GTC ATA GTG GGC AGG ATT GTC TTG TCC GGG 90 

Ser Thr Gly Cys Val Val lie Val Gly Arg lie Val Leu Ser Gly 
20 25 30 



1 0 



AAG CCG GCA ATT ATA CCT GAC AGG GAG GTT CTC TAC CAG GAG TTC 13 5 
Lys Pro Ala lie lie Pro Asp Arg Glu Val Leu Tyr Gin Glu Phe 
35 40 45 



1 5 GAT GAG ATG GAA GAG TGC 
Asp Glu Met Glu Glu Cys 
50 



(2) INFORMATION FOR SEQ ID NO:27: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

2 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

3 0 (A) NAME /KEY: pNB182A4AHT 



ATG AGA GGA TCG CAT CAC CAT CAC CAT CAC ACG GAT CCG CCC ATC 45 
Met Arg Gly Ser His His His His His His Thr Asp Pro Pro lie 
35 1 5 10 15 
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ACG GCG TAC GCC CAG CAG ACG AGA GGC CTC CTA GGG TGT ATA ATC 9 0 
Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly Cys lie lie 
20 25 30 

5 ACC AGC CTG ACT GGC CGG GAC AAA AAC CAA GTG GAG GGT GAG GTC 13 5 
Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu Gly Glu Val 
35 40 45 

CAG ATC GTG TCA ACT GCT ACC CAA ACC TTC CTG GCA ACG TGC ATC 18 0 
1 0 Gin lie Val Ser Thr Ala Thr Gin Thr Phe Leu Ala Thr Cys lie 

50 55 60 

AAT GGG GTA TGC TGG ACT GTC TAC CAC GGG GCC GGA ACG AGG ACC 22 5 
Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Thr Arg Thr 

1 5 65 70 75 

ATC GCA TCA CCC AAG GGT CCT GTC ATC CAG ATG TAT ACC AAT GTG 27 0 
lie Ala Ser Pro Lys Gly Pro Val lie Gin Met Tyr Thr Asn Val 
80 85 90 

20 

GAC CAA GAC CTT GTG GGC TGG CCC GCT CCT CAA GGT TCC CGC TCA 315 
Asp Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly Ser Arg Ser 
95 100 105 

2 5 TTG ACA CCC TGC ACC TGC GGC TCC TCG GAC CTT TAC CTG GTT ACG 3 60 

Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr 
110 115 120 

AGG CAC GCC GAC GTC ATT CCC GTG CGC CGG CGA GGT GAT AGC AGG 4 05 

3 0 Arg His Ala Asp Val lie Pro Val Arg Arg Arg Gly Asp Ser Arg 

125 130 135 

GGT AGC CTG CTT TCG CCC CGG CCC ATT TCC TAC CTA AAA GGC TCC 45 0 
Gly Ser Leu Leu Ser Pro Arg Pro lie Ser Tyr Leu Lys Gly Ser 
3 5 140 145 150 

TCG GGG GGT CCG CTG TTG TGC CCC GCG GGA CAC GCC GTG GGC CTA 49 5 
Ser Gly Gly Pro Leu Leu Cys Pro Ala Gly His Ala Val Gly Leu 
155 160 165 
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TTC AGG GCC GCG GTG TGC ACC CGT GGA GTG ACC AAG GCG GTG GAC 54 0 
Phe Arg Ala Ala Val Cys Thr Arg Gly Val Thr Lys Ala Val Asp 
170 175 180 

TTT ATC CCT GTG GAG AAC CTA GAG ACA ACC ATG AGA TCC CCG GGG 585 
Phe lie Pro Val Glu Asn Leu Glu Thr Thr Met Arg Ser Pro Gly 
185 190 195 



1 0 GTG CTC GTT GGC GGC GTC CTG GCT GCT CTG GCC GCG TAT TGC CTG 63 0 
Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu 
200 205 210 

TCA ACA GGC TGC GTG GTC ATA GTG GGC AGG ATT GTC TTG TCC GGG 720 
1 5 Ser Thr Gly Cys Val Val lie Val Gly Arg lie Val Leu Ser Gly 

215 220 225 

AAG CCG GCA ATT ATA CCT GAC AGG GAG GTT CTC TAC CAG GAG TTC 7 65 
Lys Pro Ala . lie lie Pro Asp Arg Glu Val Leu Tyr Gin Glu Phe 
20 230 235 240 

GAT GAG ATG GAA GAG TGC CGG AAG AAA AAG AGA CGC AAG CTT AAT 810 
Asp Glu Met Glu Glu Cys Arg Lys Lys Lys Arg Arg Lys Leu Asn 
245 250 255 

25 

(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 162 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

3 5 (ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: Native NS4A 

TCA ACA TGG GTG CTC GTT GGC GGC GTC CTG GCT GCT CTG GCC GCG 4 5 
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Ser Thr Trp Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala 
1 5 10 15 

TAT TGC CTG TCA ACA GGC TGC GTG GTC ATA GTG GGC AGG ATT GTC 9 0 
5 Tyr Cys Leu Ser Thr Gly Cys Val Val lie Val Gly Arg lie Val 

20 25 30 

TTG TCC GGG AAG CCG GCA ATT ATA CCT GAC AGG GAG GTT CTC TAC 13 5 
Leu Ser Gly Lys Pro Ala lie lie Pro Asp Arg Glu Val Leu Tyr 
1 0 35 40 45 

CAG GAG TTC GAT GAG ATG GAA GAG TGC 
Gin Glu Phe Asp Glu Met Glu Glu Cys 
50 



15 



25 



(2) INFORMATION FOR SEQ ID NO:29: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 amino acid residues 
2 0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: polypeptide 

(ix) FEATURE: 

(A) NAME/KEY: Carboxl 33 mer of NS4A 



Cys Val Val lie Val Gly Arg He Val Leu Ser Gly Lys Pro Ala 
3 0 5 10 15 

He He Pro Asp Arg Glu Val Leu Tyr Gin Glu Phe Asp Glu Met 
20 25 30 



3 5 Glu Glu Cys 



(2) INFORMATION FOR SEQ ID NO:30: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acid residues 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

(ix) FEATURE: 

1 0 (A) NAME /KEY: Carboxl 33 mer of NS4A of HCV-BK strain 

Ser Val Val lie Val Gly Arg lie lie Leu Ser Gly Arg Pro Ala 
5 10 15 

1 5 lie Val Pro Asp Arg Glu Leu Leu Tyr Gin Glu Phe Asp Glu Met 

20 25 30 



Glu Glu Cys 
2 0 (2) INFORMATION FOR SEQ ID NO:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acid residues 

(B) TYPE: nucleic acid 

2 5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

3 0 (ix) FEATURE: 

(A) NAME /KEY: Cytomegalovirus substrate 

Arg Gly Val Val Asn Ala Ser Ser Arg Leu Ala Tyr 

3 5 (2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 amino acid residues 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: polypeptide 

(ix) FEATURE: 

(A) NAME /KEY: Cytomegalovirus substrate 

1 0 Ala Gly Val Val Ala Ser Ser Arg Leu Ala 



(2) INFORMATION FOR SEQ ID NO:33: 

1 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acid residues 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: polypeptide 

(ix) FEATURE: 

(A) NAME /KEY: Cytomegalovirus substrate 

25 

Gly Val Val Asn Ala Thr Cys Arg Leu Ala 
(2) INFORMATION FOR SEQ ID NO:34: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acid residues 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: polypeptide 
(ix) FEATURE: 



5 8 
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(A) NAME /KEY: Cytomegalovirus substrate 

Gly Val Gly Asn Ala Ser Cys Arg Leu Ala 

5 (2) INFORMATION FOR SEQ ID NO:35: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acid residues 

(B) TYPE: nucleic acid 

1 0 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: polypeptide 

1 5 (ix) FEATURE: 

(A) NAME/KEY: Cytomegalovirus substrate 



Gly Val Val Asn Gly Thr Cys Arg Leu Ala 
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WE CLAIM: 

1. A method for determining if a substrate is cleaved by an enzyme using 
surface plasmon resonance comprising: 

5 (a) placing the substrate and the protease together in solution in a. 

reaction vessel under conditions wherein the protease can cleave the 
substrate; 

(b) stopping the reaction; and 

(c) bringing the solution containing the protease and substrate into 
1 0 contact with a ligand bound to a sensor chip, wherein the ligand is able to 

bind to the substrate, and wherein the mass of the substrate versus the 
mass of the cleaved substrate is detected by surface plasmon resonance 
technology. 

15 2. The process of claim 1 wherein the substrate and the enzyme are from 
cytomegalovirus and the enzyme is a protease. 

3. The process of claim 1 wherein the substrate and the enzyme are from 
hepatitis C virus and the enzyme is an NS3 protease. 

20 

4. The method of claim 1 wherein the substrate has a larger moiety linked 
to said substrate so as to increase the mass of said substrate. 

5. The method of claim 4 wherein the moiety is a polypeptide or a protein. 

25 

6. The method of claim 5 wherein the protein is biotin covalently linked 
to said substrate. 

7. The method of claim 6 wherein streptavidin is bound to said biotin. 

30 

8. A method of determining if a test substance is a protease inhibitor to a 
substrate which the protease is able to cleave comprising: 

(a) placing the test substance in solution in a reaction vessel with 
3 5 the protease and the substrate; 

(b) bringing the solution containing the protease, substrate and test 
substance into contact with a ligand bound to a sensor chip, wherein the 
ligand is able to bind to the substrate, and wherein the mass of the 
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substrate versus the mass of the cleaved substrate is detected by surface 
plasrnon resonance technology, wherein if the substrate is cleaved as 
determined by its decrease in mass as detected by surface plasrnon 
resonance technology then the test substance is not a protease inhibitor 
5 and if the substrate is a protease inhibitor, then the substrate does not have 
a decrease in mass as is determined by surface plasrnon resonance. 

9. The method of claim 8 wherein the substrate has a larger moiety linked 
to said substrate so as to increase the mass of said substrate. 

0 

10. The method of claim 9 wherein the moiety is a polypeptide or a 
protein, 

11. The method of claim 10 wherein the protein is biotin covalently 
5 linked to said substrate. 

12. The method of claim 11 wherein streptavidin is bound to said biotin. 
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CA-A- 


2044254 


24-09-90 






DE-D- 


69023126 


23-11-95 






DE-T- 


69023126 


21-03-96 






EP-A- 


0464119 


08-01-92 






JP-T- 


4503758 


09-07-92 
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